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We have investigated the potential energy surfaces for alanine chains consisting 
of three and six amino acids. For these molecules we have calculated potential en- 
ergy surfaces as a function of the Ramachandran angles ip and if), which are widely 
used for the characterization of the polypeptide chains. These particular degrees 
of freedom are essential for the characterization of proteins folding process. Calcu- 
lations have been carried out within ab initio theoretical framework based on the 
density functional theory and accounting for all the electrons in the system. We have 
determined stable conformations and calculated the energy barriers for transitions 
between them. Using a thermodynamic approach, we have estimated the times of 
characteristic transitions between these conformations. The results of our calcula- 
tions have been compared with those obtained by other theoretical methods and with 
the available experimental data extracted from the Protein Data Base. This com- 
parison demonstrates a reasonable correspondence of the most prominent minima 
on the calculated potential energy surfaces to the experimentally measured angles <p 
and tp for alanine chains appearing in native proteins. We have also investigated the 
influence of the secondary structure of polypeptide chains on the formation of the 
potential energy landscape. This analysis has been performed for the sheet and the 
helix conformations of chains of six amino acids. 



I. INTRODUCTION 



It is well known that proteins consist of amino acids whose number may vary in the 
range from hundreds up to tens of thousands. Small fragments of proteins are usually called 
polypeptide chains or polypeptides. This work is devoted to a study of the conformational 
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properties of alanine polypeptide chains. 

Since recently, it became possible to study experimentally small fragments ofproteins 
and po ly pept ld e s in the gaS phase the use of the MALDI m a SS spectroscopy |MQ 
and the ESI mass spectroscopy p, 10J- From theoretical viewpoint, investigation of small 
polypeptides is of significant interest because they can be treated by means of ab initio 
methods which allowing accurate comparison of theoretical predictions with experiment. 
The results of ab initio calculations can be then utilized for the development of model 
approaches applicable for the description of larger and more complex protein structures. 

Polypeptides are characterized by the primary and the secondary structure 0, 0, 0, • 
Different geometrical configurations of a polypeptide are often called as the conformations. 
One can expect that chemical and physical properties of various conformations of complex 
molecules might differ significantly. The number of various conformations (isomeric states) 
grows rapidly with the growth of a system size. Thus, a search for the most stable confor- 
mations becomes an increasingly difficult problem for large molecules. With the help of the 
NMR spectroscopy and the X-rays diffraction analysis it has been shown pj, that the sheet 
and the helix structures are the most prominent elements of the protein secondary structure. 

The main difference between the sheet and the helix structures is due to the difference of 
the dihedral angles formed by the atoms of the polypeptide chains in the two cases. These 
degrees of freedom are responsible for the transition of the molecule from one conformation 
to another. By increasing the temperature of the system, the degrees of freedom responsible 
for twisting of the polypeptide chain can be activated. The study of this transition and 
evaluation of its characteristic duration are of significant interest, because this problem is 
closely related to one of the most intriguing problems of the protein physics - the protein 
folding. To study this transition it is necessary to investigate the potential energy surface of 
amino acid chains with respect to their twisting. Besides the protein folding, the potential 
energy landscapes of polypeptides carry a lot of detail and useful information about the 
structure of these molecules. 

In the present paper we have studied the potential energy surfaces for small alanine 
chains. These molecules were chosen because they are often present in native proteins as 
fragments, and also because they allow ab initio theoretical treatment due to their relatively 
small size. 

Previously, only glycine and alanine dipeptides were studied in detail. Sometimes 
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their analogues were used to reduce _ the _ computational costs (for example, (S)-ct;- 
(formylamino)propanamide). In refs. 
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, 3, 13 alanine and glycine dipeptides were inves- 
tigated within the Hartree-Fock theory. In these papers the potential energy surfaces were 
calculated versus the twisting angles of the molecules. Different stable states of the dipep- 
tides, corresponding to different molecular conformations, were determined. Each stable 
state of the molecule was additionally studied on the basis of the perturbation theor y w hich 
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takes into account many-electron correlations in the system. In refs. 
different conformations and their energies were determined within the framework of the den- 
Slty actional theor, „ re , Q d_ of the alanine dipeptide a nalog was d 1S ee SS ed 
and the time of the transitions between the two conformations of the alanine dipeptide was 
found. 
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A number of papers were devoted to the study of tripeptides. In refs. |20 
dynamics of the alanine and glycine tripeptides was studied by means of classical molecular 
dynamics and with the use of semi-empirical potentials (such as GROMOS, CHARMM and 



AMBER). In |25j within the framework of the Hartree-Fock theory several stable conforma- 
tions of alanine and glycine tripeptides were found. In ref. 2f| the Raman and IR spectra 
for alanine and glycine tripeptides were measured in neutral, acidy and alkali environments. 

Polypeptides have been studied less. We are aware of only several related papers. In 
particular, stable conformations of neutral and charged alanine hexapeptides were obtained 
with the use of empirical potentials and discussed in ref. j^. Experimental NMR study of 
various conformations of alanine heptapeptides at different temperatures was carried out in 
ref. In ref. [3| with the use of empirical molecular dynamics based on Monte- Carlo 

methods, a polypeptide consisting of 21 amino acids was described. 

In the present paper we have performed an ab initio calculation of the multidimensional 
potential energy surface for the alanine polypeptide chains consisting of three and six amino 
acids. The potential energy surface versus twisting degrees of freedom of the polypeptide 
chain has been calculated. The calculations have been performed within ab initio theoretical 
framework based on the density functional theory (DFT) accounting for all the electrons in 
the system. Previously, this kind of calculations were performed only for dipeptides (see, 
eg-, QlllQ)- 

For larger molecules, only a few conformations were considered (see cita- 
tions above). We have calculated the energy barriers for the transitions between different 
conformations and determined the energetically most favorable ones. Using a thermody- 
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namic approach, we have estimated times of the characteristic transitions between the most 
energetically favorable conformations. The results of our calculation have been compared 
with other theoretical simulations and with the available experimental data. We have also 
analyzed how the secondary structure of polypeptide chains influences the potential energy 
landscapes. In particular, the role of the secondary structure in the formation of stable 
conformations of the chains of six amino acids being in the sheet and in the helix conforma- 
tions has been elucidated. Some preliminary results of our work were published as electronic 
preprints jsoi I3I | . 

Our paper is organized as follows. In section |H] we give a brief overview of theoretical 
methods used in our work. In section IIHI we present and discuss the results of our com- 
putations. In section IIVI we draw a conclusion to this paper. The atomic system of units, 
|e| = m e — h — 1, is used throughout the paper unless other units are indicated. 



II. THEORETICAL METHODS 



In the present paper we study the multidimensional potential energy surfaces for alanine 
polypeptides within the framework of the density functional theory. The potential energy 
surfaces are multidimensional functions of atomic coordinates. In our work the potential 
energy surfaces are considered as a function of the dihedral angles formed by the atoms of 
the polypeptide chain. For this calculation the Born-Oppenheimer approximation allowing 
to separate the motion of the electronic and ionic subsystems is used. 

The density functional theory (DFT) is a common tool for the calculating of properties 
of quantum many body systems in which many electron correlations play an important role. 
The DFT formalism is well known and can be found in many textbooks (see e.g. [3, 
Therefore in our work we present only the basic equations and ideas of this method. 

Electronic wave functions and energy levels within the framework of DFT are obtained 



from the Kohn-Sham equations, which read as (see e.g. 32I l33j|): 



where the first term represents the kinetic energy of the z'-th electron with the wavefunction 
ipi and the energy Si, Ui ons describes the electron attraction to the ionic centers, Vh is the 
Hartree part of the interelectronic interaction ref. j^j], V xc is the local exchange-correlation 
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potential. 

The exchange-correlation potential is defined as a functional derivative of the exchange- 
correlation energy functional: 

_ SE xc [p] 
5p{r) 

Equation (J2J) is exact and follows from the Hohenberg theory ref . |35[ . However, no unique 
potential E xc , universally applicable for different systems and conditions, has been found so 
far. 

Approximate functionals employed by the DFT usually partition the exchange-correlation 
energy into two parts, referred to as the exchange and the correlation terms: 



E xc [p\ = E x (p) + E c {p) 



(3) 



Both terms are the functionals of the electron density, which can be of two distinctly different 
types: either a local functionals depending only on the electron density p, or a gradient- 
corrected functionals depending on both p and its gradient, Vp. A variety of exchange 
correlation functionals can be found in literature. In our work we have used the hybrid 
Beche-t yP e th.ee.p_ exchange Actional ref. paired with the gradient—d 
Lee, Yang and Parr correlation functional (B3LYP) refs. [3? . 38]. 



III. RESULTS AND DISCUSSION 



A. Determination of the polypeptide twisting degrees of freedom 

In this section we present the potential energy surfaces for the glycine polypeptide chains 
calculated versus dihedral angles ip and if) defined in figure HJ In particular, we focus on the 
chains consisting of three and six amino acids. 

Both angles are defined by the four neighboring atoms in the polypeptide chain. The angle 
<Pi is defined as the dihedral angle between the planes formed by the atoms (C^_ x — Aj — Cf) 
and (Aj — Cf — CJ . The angle ipi is defined as the dihedral angle between the (A^ — Cf — 
and (Cf — C\ — N i+1 ) planes. Beside the angles ifi and ipi there is an angle which is 
defined as the dihedral angle between (Cf — C\ — N i+1 ) and (C[ — N i+ i — Cf +1 ) planes. The 
atoms are numbered from the NH2— terminal of the polypeptide. The angles (fi, ipi and 
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FIG. 1: Dihedral angles 99 and ^ used to characterize the potential energy surface of the polypeptide 
chain. 

u)i take all possible values within the interval [—180°; 180°]. For the unambiguous definition 
we count the angles ipi, ipi and clockwise, if one looks on the molecule from its NH 2 — 
terminal (see fig. Q). This way of angle counting is the most commonly used [lfl ]. 

The angles ipi and ipi can be defined for any amino acid in the chain, except the first and 
the last ones. Below we omit the subscripts and consider angles ip and ifi for the middle 
amino acid of the polypeptide. 

B. Optimized geometries of alanine polypeptides 

In order to study twisting of a polypeptide chain one needs first to define its initial 
structure. Although, the number of its conformations increases with the growth of the 
molecule size, there are certain types of polypeptide structure, namely the sheet and the 
helix conformations, which are the most typical. In the present paper we have investigated 
twisting of the polypeptide chains of the sheet and the helix conformations. By varying the 
angles <p and ip in the central amino acid one can create the structure of the polypeptide 
differing significantly from the pure sheet or helix conformations. If the structure of a 
polypeptide can be transformed to a helix or a sheet one by a trivial variation of tp and ip, 
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such polypeptides for the sake of simplicity are referred below as belonging to the group of 
the helix or the sheet structure, respectively. 




E=-818.4665 E=-1560.5108 E=-1560.5022 



FIG. 2: Optimized geometries of alanine polypeptide chains calculated by the B3LYP/6- 
31++G(d,p) method: a) Alanine tripeptide; b) Alanine hexapeptide (sheet conformation); c) 
Alanine hexapeptide (helix conformation). 

In figure El we present the optimized geometries of alanine polypeptide chains that have 
been used for the exploration of the potential energy surfaces. All geometries were optimized 
with the use of the B3LYP functional. Figure |2K shows the alanine tripeptide structure. In 
the present work we choose the sheet conformation, because the tripeptide is too short to 
form the helix conformation. Figures EJd and |2t show alanine hexapeptide in the sheet and 
the helix conformations, respectively. The total energies (in atomic units)of the molecules 
are given below the images. 

C. Polypeptide energy dependance on the dihedral angle u 

For each amino acid there are only three dihedral angles formed by atoms of the polypep- 
tide chain which describe its twisting. The angle u (see fig. differs from the angles ip 
and ^, because C\ atom has the sp 2 hybridization state, what leads to formation of a quasi- 
double bond between C i and N i+ i atoms. Therefore, the angle to is often referred as a "stiff" 
degree of freedom, whose value depends only slightly on both the polypeptide constituent 
amino acids and the values of other degrees of freedom. To illustrate this fact, in figure El we 
present the energy dependencies on u calculated for alanine dipeptide with different values 
of angles ip and ip in the cenral amino acid. 

From this figure it is clear that there are two stable states in the system with u = 0° and 
uj = 180° which do not depend on the angles <p and ip. The heights of the barriers between 
these states are weakly depend on ip and ip, being equal to ~1 eV=23.06 kcal/mol. 
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FIG. 3: Dependance of alanine tripeptides energy on angle u calculated by the B3LYP/6-31G(d) 
method at different values of angles (p and ip. 

The calculation shows that at temperatures close to the room temperature, the value of 
the angle u changes insignificantly. The potential energy surface as a function of the angles 
(p and ifi appears to be much more complex as it is shown in the next section. 



D. Potential energy surface for alanine tripeptide 

In figure 0] we present the potential energy surface for the alanine tripeptide calculated 
by the B3LYP/6-31G(2d,p) method. The energy scale is given in eV, kcal/mol and Kelvin. 
Energies on the plot are measured from the lowest energy minimum of the potential energy 
surface. 

From the figure follows that there are several minima on the potential energy surface. 
They are numbered according to the value of the corresponding energy value. Each min- 
imum corresponds to a certain conformation of the molecule. These conformations differ 
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FIG. 4: Potential energy surface for the alanine tripeptide calculated by the B3LYP/6-31G(2d,p) 
method. Energies are given in eV, kcal/mol and Kelvin. Numbers mark energy minima on the 
potential energy surface. Arrows show transition paths between different conformations of the 
molecule. 

significantly from each other. In the case of alanine tripeptide there are six conformations, 
shown in figure Dashed lines show the strongest hydrogen bonds in the system, which arise 
when the distance between hydrogen and oxygen atoms becomes less then 2.9 angstroms. 

To calculate the potential energy surface the following procedure was adopted. Once the 
stable structure of the molecule has been determined and optimized, all but two (these are 
the angles tp and if) in the central amino acid) degrees of freedom were frozen. Then the 
energy of the molecule was calculated by varying if and if>. This procedure was used to 
calculate all potential energy surfaces presented below in this section. It allows one to find 
efficiently the minima on the energy surface and to determine the main stable conformations 
of the molecule. The absolute energy values of different conformations of the tripeptide 
found by this method are not too accurate, because the method does not account for the 
relaxation of other degrees of freedom in the system. To calculate the potential energy 
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E =-818.4469 (a.u.) 



AE =-0.052 (0.026) 



AE=0.162 (0.280) 




qi— 157.4 (-156,8) 
y/= 166,2 (163.6) 

4 

AE =0.203 (0.283) 




*i 2.426 



<p=- 166.9 (-166.1) 
\|/=-52.1 (-54.9) 



cp=-82.3 (-82.7) 
j(f=6S,3 (75.6) 



AE=0.Q18 (0.299) 
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i]j=-60.5 (-53.1) 



ip=64.7 (53.5) 
y=30.5 (37.1) 



AE =0.077 (0.342) 
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FIG. 5: Optimized conformations of the alanine tripeptide. Different geometries correspond to 
different minima on the potential energy surface (see contour plot in figure 0J. Below each image 
we present angles (p and tp, which have been obtained with accounting for relaxation of all degrees 
of freedom in the system. Values in brackets give the angles calculated without accounting for 
relaxation. Above each image the energy of the corresponding conformation is given in eV. The 
energies are counted from the energy of conformation 1 (the energy of conformation 1 is given in 
a.u.). Values in parentheses correspond to the energies obtained without relaxation of all degrees 
of freedom in the system. Dashed lines show the strongest hydrogen bonds. Their lengths are 
given in angstroms. 
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surface with accounting for the relaxation one needs 20-30 times more of the computer time. 
Therefore, a calculations with accounting for the relaxation have not been performed in our 
work. Instead, we have performed a complete optimization of the molecular conformations, 
corresponding to all minima on the calculated potential energy surface. 

In figureElwe compare stable conformations of the alanine tripeptide calculated with and 
without accounting for the relaxation of all atoms in the system. As it is seen from this 
figure the angles <p and ip differ by about 10 percent in the two cases. This difference arises 
due to the coupling of tp and ip with other degrees of freedom. Note the change of the sign of 
the relative energies of some conformations. This effect is due to the rearrangement of side 
atoms (radicals) in the polypeptide chain which lowers the energies of different conformations 
differently. 

In our work the potential energy surface has been calculated and interpolated on the grid 
with the step of 18°. This step size is an optimal one, because the interpolation error is 
about 9°, i.e. comparable with the angle deviations caused by the relaxation of all degrees 
of freedom in the system. 

Note that for the alanine tripeptide an additional maximum appears at 93 = 120° ± 50°, 
ip = 30° ±30°, while it is absent on the potential energy surface for the glycine tripeptide |3(|. 
This maximum is a result of overlapping of the side CH 3 - radicals, which are substituted in 
the case of the glycine polypeptide with the H- atoms. 



TABLE I: Comparison of dihedral angles ip and ifi corresponding to different conformations of 
alanine tripeptide. 



Conformation 




ipUH 








f 


ip 


1 


-168.4 


170.5 


-157.2 


159.8 




-157.4 


166.2 


2 






-60.7 


-40.7 




-82.3 


-68.3 


3 


63.8 


32.7 


67.0 


30.2 




64.7 


30.5 


4 












-166.9 


-52.1 


5 


74.1 


-57.3 


76.0 


-55.4 




72.0 


-60.5 


6 


-128.0 


29.7 


-130.9 


22.3 




-119.1 


13.6 
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57.9 


-136.3 



In ref. |ll| and ref. |l2j several stable conformations were found for alanine and glycine 
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dipeptides. The values of angles ip and ip for the stable conformations of dipeptide and 
tripeptide are close indicating that the third amino acid in tripeptide makes relatively small 
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uence on the values of dihedral angles of two other amino acids. In earlier papers refs. 



Ill |12| dipeptides were studied within the framework of the Hartree-Fock theory. In ref. 
11 1, values of if and ip were obtained by the HF/6-31+G* method, and in ref. by 
HF/6-31G**. In table U we compare the results of our calculation for tripeptide with the 
corresponding data obtained for dipeptides. Some discrepancy between the values presented 
is due to the difference between the dipeptide and tripeptide (i.e. the third alanine in the 
tripeptide affects the values of angles tp and ip). However, another source of discrepancy 
might arise due to accounting for the many-electron correlations in the DFT and neglecting 
this effect in the Hartree-Fock theory used in refs. 

HQ. 
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FIG. 6: Transition barriers for between conformations 1 <-> 2 of the alanine tripeptide. Circles and 
squares correspond to the barriers calculated without and with relaxation of all degrees of freedom 
in the system. 



Figure 0] shows that some domains of the potential energy surface, where the potential 
energy of the molecule increases significantly, appear to be unfavorable for the formation 
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of a stable molecular configuration. The growth of energy takes place when some atoms in 
the polypeptide chain approach each other at small distances. Accounting for the molecule 
relaxation results in the decrease of the system energy in such cases, but the resulting 
molecular configurations remain unstable. We call such domains on the potential energy 
surface as forbidden ones. In figure |U one can identify two forbidden regions in the vicinity 
of the points (0, 0) and (0, 180). At (0, 0)a pair of hydrogen and oxygen atoms approach 
to the distances much smaller than the characteristic H — O bond length. This leads to a 
strong interatomic repulsion caused by the exchange interaction of electrons. At (0, 180) 
the Coulomb repulsion of pair of oxygen atoms causes the similar effect. 

Figure |U shows that there are six minima on the potential energy surface for alanine 
tripeptide. The transition barrier between the conformations 1 <-> 2 is shown in figure H3 
The barrier has been calculated with and without relaxation of the atoms in the system. 
The corresponding transition path is marked in figure 0] by an arrow. This comparison 
demonstrates that accounting for the relaxation significantly lowers the barrier height and 
influences the relative value of energy of the minima. 

Let us now estimate the time needed for a system for the transition from one conformation 
to another. To do this we use the Arhenius equation, which reads as: 

- = fte"^ (4) 
r 

where r is the transition time, Q is the factor, determining how frequently the system 
approaches the barrier, AE is the barrier height, T is the temperature of the system, k is 
the Bolzmann factor. 

Figure U\ shows the transition barrier between two main conformations of the alanine 
dipeptide analog ((S)-a-(formylamino)propanamide). It is seen that AE\^ 2 = 0.047 eV for 
the transition 1 — ► 2, while AE 2 ^i = 0.079 eV for the transition 2 —>■ 1. The frequency 
Q for this molecule is equal to 42.87 cm -1 . Thus, at T = 300 K, we obtain T^^ia ~ 5 
ps and T^\ la ~ 17 ps. This result is in excellent agreement with the molecular dynamics 
simulations results obtained in ref. predicting r ~ 7 ps for the transition 1 — > 2 and 
t ~ 19 ps for the transition 2 — >• 1. This comparison demonstrates that our method is 
reliable enough and it can be used for the estimation of transition times between various 
conformations of the polypeptides. 

Using the B3LYP/6-31G(2d,p) method we have calculated the frequencies of normal 
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FIG. 7: Transition barriers for between conformations 1 *-* 2 of alanine dipeptide analog calculated 
by the B3LYP/6-31+G(2d,p) method accounting for the relaxation of all degrees of freedom in the 
system. Structure of the conformations 1 and 2 is shown near each minimum. 

vibration modes for the alanine tripeptide. The characteristic frequency corresponding to 
twisting of the polypeptide chain is equal to 32.04 cm -1 . From figure El follows that AEi_> 2 = 
0.066 eV for the transition 1 — >• 2 and Ai^-a = 0.114 eV for the transition 2 — >• 1. Thus, 
we obtain r^J la ~ 13 ps and r|^ a ~ 86 ps. Let us note that these transition times can be 
measured experimentally by means of NMR refs. jloL lo^t . 

E. Potential energy surface for alanine hexapeptide with the sheet and the helix 

secondary structure 

In figure|Slwe present contour plots of the potential energy surface for the alanine hexapep- 
tide with the sheet (part a) and the helix (part b) secondary structure respectively versus 
dihedral angles ip and ip. In both cases the forbidden regions arise because of the repulsion 
of oxygen and hydrogen atoms analogously to the alanine tripeptide case. 

Minima 1-6 on the potential energy surface |Ht correspond to different conformations of 
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the alanine hexapeptide with the sheet secondary structure. Note that these minima are 
also present on the potential energy surface of the alanine tripeptide (see fig. 0J). Geometries 
of the conformations 1-6 are shown on the right-hand side of figure IHt- 

Energy barrier as a function of a scan variable (see figure IHk) for the transition between 
conformations 1 and 2 is shown in figure El The energy dependence has been calculated with 
and without relaxation of all the atoms in the system. In the case of alanine hexapeptide 
with the sheet secondary structure the barrier height for the transition 1 — > 2 is significantly 
higher than for the transition 2 — > 1, being equal to 0.095 eV and 0.023 eV, respectively. The 
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FIG. 8: Potential energy surface for the alanine hexapeptide with the sheet secondary structure 
(part a) and with the helix secondary structure (part b) calculated by the B3LYP/6-31G(2d,p) 
method. Energy scale is given in figure 0J Numbers mark energy minima on the potential en- 
ergy surface. Images of optimized conformations of the alanine hexapeptide are shown near the 
corresponding energy landscape. Values of angles ip and if), as well as the relative energies of the 
conformations are given analogously to that in figure 
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FIG. 9: Transitions barriers between conformations 1 «-» 2 of the alanine hexapeptide with the 
sheet secondary structure. Circles and squares correspond to the barriers calculated without and 
with relaxation of all degrees of freedom in the system. 

normal vibration mode frequency, corresponding to the twisting of the polypeptide chain is 
equal to 6.24 cm' 1 and was calculated with the B3LYP/STO-3G method. Using equation 
(HJ) one derives the transition times at room temperature: T^Q ly ~ 211 ps, Tq^*q 1v ~ 13 ps. 

Let us now consider alanine hexapeptide with the helix secondary structure. The potential 
energy surface for this polypeptide is shown in figure |Hb- The positions of minima on this 
surface are shifted significantly compared to the cases discussed above. This change takes 
place because of the influence of the secondary structure of the polypeptide on the potential 
energy surface. The geometries of the most stable conformations are shown on the right-hand 
side of figure |Hb- 

For the alanine hexapeptide with the helix secondary structure there is a maximum at 
ip ~ 180° and ip ~ 40° in addition to the central maxima on the potential energy surface. 
This maximum appears because of the repulsive interaction of the outermost amino acids 
side radicals. 

It is worth noting that for some conformations of alanine hexapeptide the angles <p and 
■0 change significantly when the relaxation of all degrees of freedom in the system account- 
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ing for (see for example conformations 1, 5 in fig. |Sk and conformations 2, 4 in fig. |Hb)- 
This means that the potential energy surface of the alanine hexapeptide in the vicinity of 
the mentioned minima is very sensitive to the relaxation of all degrees of freedom. How- 
ever, calculation of the potential energy surface with accounting for the relaxation of all 
degrees of freedom is unfeasible task. Indeed, one needs about 2000 hours of computer time 
(Pentium Xeon 2.4 GHz) for the calculation of the potential energy surface for the alanine 
hexapeptide. To perform an analogues calculation with accounting for the relaxation about 
5 years of computer time would be needed. Nevertheless, the potential energy surface cal- 
culated without accounting for the relaxation carries a lot of useful information. Thus, one 
can predetermine stable conformations of polypeptide, which then can be used as starting 
configurations for further energy minimization. 



F. Comparison of calculation results with experimental data 

Nowadays, the structure of many proteins has been determined experimentally ref. 
Knowing the protein structure one can find the angles and for each amino acid in the 
protein. 

In figure ITOk we show a map of the allowed and forbidden conformations for alanine 
residues in poly-alanine chain taken from ref. j^] (steric Ramachandran diagram) . This map 
was obtained from pure geometrical considerations, in which the structure of the polypeptide 
was assumed to be fixed and defined by the interatomic van der Waals interaction radii. De- 
pending on the distances between the atoms one could distinguish three regions: completely 
allowed, conventionally allowed and forbidden. The conformation is called completely al- 
lowed if all the distances between atoms of different amino acids are larger than some critical 
value Tij > r max . Conventionally allowed regions on the potential energy surface correspond 
to the conformations of the polypeptide, in which the distances between some atoms of 
different amino acids lie within the interval r m « n < < r max . All other conformations are 
referred to as forbidden. The values of r min and r max are defined by the types of interact- 
ing atoms and can be found in the textbooks (see, e.g., j^). In figure ITUk we mark the 
completely allowed regions with white, the conventionally allowed regions with light gray 
and the forbidden regions with dark gray color. In this figure we mark the points, which 
correspond to the geometries of alanine, whose periodical iteration leads to the formation 
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FIG. 10: Comparison of angles <p and ip of alanine residues in protein structures selected from 
the Brookhaven Protein Data Bank [j], with the steric diagram for poly-alanine j^lj (part 
a)). Comparison of angles ip and j) of alanine residues in protein structures selected from the 
Brookhaven Protein Data Bank with the minima on the calculated potential energy surfaces 

for: alanine tripeptide (b); alanine hexapeptide in sheet conformation (c); alanine hexapeptide in 
helix conformation (d). Transparent rhomboids correspond to alanines surrounded with alanines, 
while filled circles correspond to alanines surrounded by other amino acids. Dashed ellipses mark 
the regions of higher concentration of the observed angles. 



of chains with specific secondary structure. In table |H] we compile the values of angles ip 
and ip, which correspond to the most prominent poly-alanine secondary structures. For the 
illustrative purposes we mark these points by white circles with the corresponding type of 
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the secondary structure typed in. Thus, 2^, 2j are the right-handed and the left-handed 
2 7 helix; 3^,, 3f are the right-handed and the left-handed 3i helix; or, are the right- 
handed and the left-handed a— helix (4i 3 ); 7Tr, ttl are the right-handed and the left-handed 
Ti— helix (5i6); tt? Tl are the parallel and antiparallel sheets. Pi, (3n correspond to the 
j3— turns of types I and II respectively. 



TABLE II: Angles p and tj) corresponding to the most prominent poly-alanine secondary structures. 



Structure type 


P (Deg.) 


ip (Deg.) 


right-handed (left-handed) 2-j helix 


-78 (78) 


59 (-59) 


right-handed (left-handed) 3io helix 


-49 (49) 


-26 (26) 


right-handed (left-handed) a— helix (4i3) 


-57 (57) 


-47 (47) 


right-handed (left-handed) tt— helix (5xe ) 


-57 (57) 


-70 (70) 


parallel (5 sheet (||) 


-119 


113 


antiparallel (5 sheet (jj) 


-139 


135 


(3— turn of type I 


-90 





(3— turn of type II 


90 






Note that not all of the structures listed above are present equally in proteins. In figure 
ITUk we show the distribution of the angles ip and ip of alanine residues in protein structures 
selected from the Brookhaven Protein Data Bank P, (^(J . It is possible to distinguish four 
main regions, in which most of experimental points are located. In figure El these regions are 
schematically shown with dashed ellipses. Note, that these ellipses are used for illustrative 
purposes only, and serve for a better understanding of the experimental data. The regions 
in which most of the observed angles p and ip are located correspond to different secondary 
structures of the poly-alanine. Thus, region I corresponds to the parallel and antiparallel 
(3— sheets. Region II corresponds to the right-handed 2^ helix. Region III corresponds to the 
right-handed a^— helix, right-handed ttr— helix, right-handed 3j^ helix and /3— turn of type 
I. Region IV corresponds to the left-handed a^— helix, right-handed ttl— helix, left-handed 
3fo helix and (3— turn of type II. In some cases there are several types of secondary structure 
within one domain. In the present work we have not studied the secondary structure of 
proteins systematically enough to establish the univocal correspondence of the observed 
experimental points to different types of the secondary structure. 
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Let us now compare the distribution of angles tp and ip experimentally observed for pro- 
teins with the potential energy landscape calculated for alanine polypeptides and establish 
correspondence of the secondary structure of the calculated conformations with the predic- 
tions of the simple Ramachandran model. 

Region I corresponds to the minimum 1 on the both potential energy surfaces of the 
alanine tripeptide (fig. ITUb) and the alanine hexpeptide with the secondary structure of 
sheet (fig. ITUb). These conformations correspond exactly to the alanine chains in the (3- 
sheet conformation (see fig. and Ek) . Note that there is no minimum in that region of the 
potential energy surface for alanine hexapeptide with the secondary structure of helix (see 

fig. nnn). 

Region II corresponds to the minimum 2 on the both potential energy surfaces ITUb and 
ITUb . as well as to the minimum 3 on the potential energy surface HUH. On the steric diagram 
for poly-alanine this region corresponds to the right-handed 2f helix. The structure of 
conformations 2 on the surfaces ITUb and ITUh differs from the structure of this particular 
helix type. Only the central alanines, for which the angles tp and x/j in figures ITUb and ITUb 
are defined, have the structure of 2f helix. Thus, one can refer to the conformations 2 as 
to the mixed states, where the central part of the polypeptide chain has the conformation 
of helix and the outermost parts have the conformation of sheet. Conformation 3 on the 
surface ITUH is also a mixed state. Here one can distinguish one turn of 3f helix and two 
turns of 2^ helix (see fig. |Hb). 

Region III corresponds to the structure of right-handed or— helix, right-handed 3f helix, 
right-handed ttr— helix and (3— turn. It corresponds to minima 6, 5 and 4 on the potential 
energy surfaces ITUb. ITUb and ITUH respectively. Conformation 6 can not be assigned to any 
specific type of secondary structure because the chain is too short. Note, that conformation 
6 is even not a stable one on the potential energy surface of the alanine tripeptide. The 
most probable types of secondary structures in that region of the potential energy surface are 
right-handed or— helix and /3— turn. However, for the formation of a single turn of ck_r— helix 
(or for the formation of (3— turn) at least four amino acids are needed. Conformation 5 on 
the potential energy surface of the alanine hexapeptide can be characterized as a partially 
formed (3— turn because the alanine, for which the dihedral angles ip and ip in figure ITUb 
are defined has the geometry of (3— turn, but its neighbor forms a (3— sheet (see fig. UtO- 
Conformation 4 on the potential energy surface |Sb changes significantly after accounting for 
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the relaxation of all degrees of freedom in the system, and gets outside the region III. In 
this conformation one can locate fragments of right-handed 2^ and 3^ helixes. The point 
corresponding to the minimum 4 (after accounting for the relaxation) lies outside regions II 
and III because angles ip and if) in figure ITUH are defined for the amino acid between two 
helix fragments. 

Region IV is represented by the structure of left-handed oil— helix, left-handed 3f helix, 
left-handed 7r^— helix and /?— turn of type II. The fragments with those types of secondary 
structures are very rare met in native proteins. To form these structures it is necessary to 
have at least four amino acids, therefore minima 3 on the potential energy surface for alanine 
tripeptide can not be compared to any type of the mentioned secondary structures. Region 
IV corresponds to the conformations 3 and 2 on the surfaces ITUb and ITUH respectively. 
Conformation 3 on the surface ITUb corresponds to partially formed (3— turn, because the 
alanine, for which the dihedral angles tp and tp in figure ITUh are plotted has the configuration 
of j3— turn but the neighboring amino acid in the polypeptide chain forms (3— sheet (see fig. 
IHK)- Conformation 2 on the potential energy surface HUH lies outside the region IV, but 
accounting for the relaxation of all degrees of freedom shifts the minimum on the potential 
energy surface to the allowed region of left-handed a^— and 3f helix (see fig. |Ht>)- The 
geometry of conformation 2 is similar to the geometry of left-handed 3f helix (see fig. Eh)- 
The main differences in the structure are caused by the insufficient length of the polypeptide 
chain to form a regular helix structure. 



IV. CONCLUSION 



In the present paper the multidimensional potential energy surfaces for amino acid chains 
consisting of three and six alanines has been investigated and the conformational properties 
of these systems with respect to the twisting of the polypeptide chain have been described. 
The calculations have been carried out within ah initio theoretical framework based on 
the density functional theory and accounting for all the electrons in the system. We have 
determined stable conformations and calculated the energy barriers for transitions between 
them. Using a thermodynamic approach, we have estimated times of the characteristic 
transitions between the conformations. It was demonstrated that the transition times lie 
within the picosecond region. Our estimates are compared with the available molecular- 
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dynamics simulations results, and the correspondence between the results of the two different 
methods is reported. A strong barrier asymmetry between neighboring stable conformations 
on the potential energy surface was found. 

We compared for the first time values of angles tp and ip for alanine residues experi- 
mentally observed in real proteins with the coordinates of minima on the potential energy 
surfaces. This comparison showed that all profound minima on the potential energy surfaces 
correspond to the regions in which experimentally measured values of if and tp are located. 
We have also analysed how the secondary structure of polypeptide chains influences the 
formation of the potential energy landscapes. For the chains of six amino acids with the 
secondary structures of sheet and helix the influence of the secondary structure on the stable 
conformations of the molecule has been demonstrated. 

The results of this work can be utilized for modeling more complex molecular systems. 
For example, the suggested model for for the estimation of the characteristic transition times 
can be used for longer polypeptide chains, also consisting of different amino acids and for 
estimates of time of proteins folding. It is also possible to use the results of the present work 
for testing the applicability and accuracy of different model approaches for the polypeptide 
description requiring much less computer time than ab initio calculations. 
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